Selectivity Estimation of Inequality Joins in Databases

نویسندگان

چکیده

Selectivity estimation refers to the ability of SQL query optimizer estimate size results a predicate in query. It is main calculation based on which can select least expensive plan execute. While problem has been known since mid-1970s, we were surprised that there are no solutions literature for selectivity inequality joins. By testing four common database systems: Oracle, SQL-Server, PostgreSQL, and MySQL, found open-source systems PostgreSQL MySQL lack this estimation. Oracle SQL-Server make fairly accurate estimations, yet their algorithms secret. This paper, thus, proposes an algorithm join The proposed was implemented sent as patch be included next releases. We compared implementation with above DBMS three different data distributions (uniform, normal, Zipfian) showed our provides extremely estimations (below 0.1% average error), outperforming other by order magnitude.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Selectivity Estimation for Spatial Joins

Spatial Joins are important and time consuming operations in spatial database management systems. It is crucial to be able to accurately estimate the performance of these operations so that one can derive efficient query execution plans, and even develop/refine data structures to improve their performance. While estimation techniques for analyzing the performance of other operations, such as ra...

متن کامل

Selectivity Estimation for Joins Using Systematic Sampling

We propose a new approach to the estimation of join selectivity. The technique, which we have called “systematic sampling”, is a novel variant of the sampling-based approach. Systematic sampling works as follows: Given a relation R of N tuples, with a join attribute that can be accessed in ascending/descending order via an index, if n is the number of tuples to be sampled from R, select a tuple...

متن کامل

Selectivity Estimation for Spatial Joins with Geometric Selections

Spatial join is an expensive operation that is commonly used in spatial database systems. In order to generate efficient query plans for the queries involving spatial join operations, it is crucial to obtain accurate selectivity estimates for these operations. In this paper we introduce a framework for estimating the selectivity of spatial joins constrained by geometric selections. The center p...

متن کامل

Selectivity Estimation in Extensible Databases - A Neural Network Approach

Extensible database systems allow users to create new data types representing spatial, text, image, and other multimedia objects, and functions which operate on the new types[S96]. The new data types and functions can be used as predicates in SQL queries. Consider an insurance industry application, built using an extensible DBMS that has user defined extensions for spatial, text, date/time, and...

متن کامل

Lightning Fast and Space Efficient Inequality Joins

Inequality joins, which join relational tables on inequality conditions, are used in various applications. While there have been a wide range of optimization methods for joins in database systems, from algorithms such as sort-merge join and band join, to various indices such as B-tree, R⇤-tree and Bitmap, inequality joins have received little attention and queries containing such joins are usua...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Mathematics

سال: 2023

ISSN: ['2227-7390']

DOI: https://doi.org/10.3390/math11061383